Estimating divergence times in large molecular phylogenies.
نویسندگان
چکیده
Molecular dating of species divergences has become an important means to add a temporal dimension to the Tree of Life. Increasingly larger datasets encompassing greater taxonomic diversity are becoming available to generate molecular timetrees by using sophisticated methods that model rate variation among lineages. However, the practical application of these methods is challenging because of the exorbitant calculation times required by current methods for contemporary data sizes, the difficulty in correctly modeling the rate heterogeneity in highly diverse taxonomic groups, and the lack of reliable clock calibrations and their uncertainty distributions for most groups of species. Here, we present a method that estimates relative times of divergences for all branching points (nodes) in very large phylogenetic trees without assuming a specific model for lineage rate variation or specifying any clock calibrations. The method (RelTime) performed better than existing methods when applied to very large computer simulated datasets where evolutionary rates were varied extensively among lineages by following autocorrelated and uncorrelated models. On average, RelTime completed calculations 1,000 times faster than the fastest Bayesian method, with even greater speed difference for larger number of sequences. This speed and accuracy will enable molecular dating analysis of very large datasets. Relative time estimates will be useful for determining the relative ordering and spacing of speciation events, identifying lineages with significantly slower or faster evolutionary rates, diagnosing the effect of selected calibrations on absolute divergence times, and estimating absolute times of divergence when highly reliable calibration points are available.
منابع مشابه
Dating Phylogenies with Hybrid Local Molecular Clocks
BACKGROUND Because rates of evolution and species divergence times cannot be estimated directly from molecular data, all current dating methods require that specific assumptions be made before inferring any divergence time. These assumptions typically bear either on rates of molecular evolution (molecular clock hypothesis, local clocks models) or on both rates and times (penalized likelihood, B...
متن کاملEstimating diversification rates from the fossil record
Diversification rates are estimated from phylogenies, typically without fossils, except in paleontological studies. By nature, rate estimations depend heavily on the time data provided in phylogenies, which are divergence times and (when used) fossil ages. Among these temporal data, fossil ages are by far the most precisely known (divergence times are inferences calibrated with fossils). We pro...
متن کاملProspects for building large timetrees using molecular data with incomplete gene coverage among species.
Scientists are assembling sequence data sets from increasing numbers of species and genes to build comprehensive timetrees. However, data are often unavailable for some species and gene combinations, and the proportion of missing data is often large for data sets containing many genes and species. Surprisingly, there has not been a systematic analysis of the effect of the degree of sparseness o...
متن کاملDating phylogenies with sequentially sampled tips.
We develop a Bayesian Markov chain Monte Carlo (MCMC) algorithm for estimating divergence times using sequentially sampled molecular sequences. This type of data is commonly collected during viral epidemics and is sometimes available from different species in ancient DNA studies. We derive the distribution of ages of nodes in the tree under a birth-death-sequential-sampling (BDSS) model and use...
متن کاملBayesian dating of shallow phylogenies with a relaxed clock.
Bayesian methods are increasingly being used to estimate divergence times without the restrictive assumption of a global clock. Little is known about their reliability for shallow phylogenies where DNA sequence divergence is low. We analyzed both simulated and real sequences to evaluate dating methods in phylogenies with mid-late Miocene roots. A large number of data sets (5000) with 10 taxa ea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings of the National Academy of Sciences of the United States of America
 
دوره 109 47 شماره
صفحات -
تاریخ انتشار 2012